Duration Modeling in Mandarin Connected Digit Recognition
نویسندگان
چکیده
Digit string recognition is required in many applications which need to recognize numbers such as telephone numbers, credit card numbers, date, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there exist two mono-phonemic digits and a heavily rhotacized vowel. In order to use duration information more efficiently, we propose a method to model context dependent word duration information and then incorporate it directly in the decoding algorithm. Experimental results show that this method reduces word error rate by as much as 32.1%.
منابع مشابه
Performance of Mandarin Connected Digit Recognizer with Word Duration Modeling
Digit string recognition is required in many applications such as automatic banking system, database information retrieving system, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there exist two mon...
متن کاملImprovement in Connected Mandarin Digit Recognition by Explicitly Modeling Coarticulatory Information
The most successful training scheme for recognition of connected spoken digits is the segmental k-means algorithm, which implicitly captures the coarticulatory information of connected speech iteratively to establish reliable reference patterns. However, when this algorithm is applied to Mandarin digits, the obtained performance is inferior to that of English. Hence, a novel approach is propose...
متن کاملAn Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition
Connected Mandarin digit string speech, especially at rapid spoken rate, is very difficult to recognize correctly. In this paper, a new training method named neighboring digits pattern is proposed in order to eliminate most of deletion errors which frequently occur in Mandarin digits speech recognition at high speaking rate when we have enough quickly-spoken speech data as the training set. The...
متن کاملUse of syllable center detection for improved duration modeling in Chinese Mandarin connected digits recognition
This paper describes practical approaches for improving Mandarin digit recognition accuracy, especially in cars. We consider syllable and subword unit durations as additional source of information. The explored approach was realized in two stages. First, the system performs standard speech recognition using acoustic spectral features. As a result, an n-best list of hypotheses is generated. In t...
متن کاملAutomatic speech recognition in Mandarin for embedded platforms
In this paper, we describe a real-time automatic speech recognition system for Mandarin for low-cost embedded platforms using fixed-point digital signal processors. The hands-free, speaker-independent speech recognition system employs 41 mono-phone models for representing the sounds in Mandarin Chinese and 11 whole-word models for connected digit recognition. The system achieves greater than 98...
متن کامل